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Abstract 

The  outline  of  a  unified  theory  of  local  pragmatics  phenomena  is 
presented,  including  an  approach  to  the  problems  of  reference  resolu¬ 
tion,  metonymy,  and  interpreting  nominal  compounds.  The  TACITUS 
computer  system  embodying  this  theory  is  also  described.  The  theory 
and  system  are  based  on  the  use  of  a  theorem  prover  to  draw  the  ap¬ 
propriate  inferences  from  a  large  knowledge  base  of  commonsense  and 
technical  knowledge.  Issues  of  control  are  discussed.  Two  important 
kinds  of  implicatures  are  defined,  and  it  is  shown  how  they  can  be  used 
to  determine  what  in  a  text  is  given  and  what  is  new. 


1  The  Problems 

In  the  messages  about  breakdowns  in  machinery  that  are  being  processed  by 
the  TACITUS  system  at  SRI  Internationa],  we  find  the  following  sentence: 

(1)  We  disengaged  the  compressor  after  the  lube  oil  alarm. 

This  sentence,  like  virtually  every  sentence  in  natural  language  discourse, 
confronts  us  with  difficult  problems  of  interpretation.  First,  there  are  the 
reference  problems;  what  do  “the  compressor”  and  “the  lube  oil  alarm” 
refer  to.  Then  there  is  the  problem  of  interpreting  the  implicit  relation 
between  the  two  nouns  “lube  oil”  (considered  as  a  multiword)  and  “alarm” 
in  the  nominal  compound  “lube  oil  alarm”.  There  is  also  a  metonymy  that 
needs  to  be  expanded.  An  alarm  is  a  physical  object,  but  “after”  requires 
events  for  its  arguments.  We  need  to  coerce  “the  lube  oil  alarm”  into  “the 
sounding  of  the  lube  oil  alarm”.1  There  is  the  syntactic  ambiguity  problem 

JOne  could  say  that  “alarm”  in  this  sentence  means  the  event  of  “alarming”,  so  that 
there  is  no  metonymy.  If  we  took  this  approach,  however,  there  would  be  a  lexical  ambi- 
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of  whether  to  attach  the  prepositional  phrase  “after  the  lube  oil  alarm”  to 
“the  compressor”  or  to  “disengaged”. 

All  of  these  problems  we  have  come  to  call  problems  in  “local  pragmat¬ 
ics”.  Local  pragmatics  encompasses  reference  resolution,  metonymy,  the  in¬ 
terpretation  of  nominal  compounds  and  other  implicit  and  vague  predicates, 
and  the  resolution  of  syntactic,  lexical,  and  quantifier  scope  ambiguities.  It 
may  be  that  to  solve  these  problems,  we  need  to  look  at  the  surrounding  dis¬ 
course  and  the  context  in  which  the  utterance  is  made.  But  we  can  determine 
locally — just  from  the  sentence  itself — that  we  have  a  problem.  They  seem 
to  be  specifically  linguistic  problems,  but  the  traditional  linguistic  methods 
in  syntax  and  semantics  have  not  yielded  solutions  of  any  generality. 

The  difficulty,  as  is  well-known,  is  that  to  solve  these  problems  we  need 
to  use  a  great  deal  of  arbitrarily  detailed  general  commonsense  and  domain- 
specific  technical  knowledge.  In  sentence  (1)  we  need  to  know,  for  example, 
that  the  compressor  has  a  lube  oil  system,  which  has  an  alarm,  which  sounds 
when  the  pressure  of  the  lube  oil  drops  too  low.  We  need  to  know  that 
disengaging  and  sounding  are  events,  and  that  a  compressor  isn’t. 

A  theory  of  local  pragmatics  phenomena  must  therefore  be  a  theory 
about  how  knowledge  is  used.  The  aim  of  our  research  has  been  to  develop 
a  unified  theory  of  local  pragmatics,  based  on  the  drawing  of  appropriate 
inferences  from  a  large  knowledge  base,  and  to  implement  a  system  embody¬ 
ing  that  theory  for  solving  local  pragmatics  problems  in  naturally  occurring 
texts.  It  is  our  intention  that  in  this  theory  general  solutions  to  local  prag¬ 
matics  problems  can  be  characterized,  but  it  should  also  be  possible  to  cast 
current,  limited  approaches  to  these  phenomena  as  special  cases  of  the  gen¬ 
eral  solutions. 

This  research  is  taking  place  in  the  context  of  the  TACITUS  project,2 
the  specific  aim  of  which  is  to  develop  interpretation  processes  for  handling 
casualty  reports  (casreps),  which  are  messages  in  free-flowing  text  about 
breakdowns  in  mechanical  devices.  More  broadly,  however,  its  aim  is  to 
develop  general  procedures,  together  with  the  underlying  theory,  for  us¬ 
ing  commonsense  and  technical  knowledge  in  the  interpretation  of  written 
(and  spoken)  discourse  regardless  of  domain.  We  expect  such  interpretation 
processes  to  constitute  an  essential  component,  and  indeed  the  principal 

guity  problem  of  deriding  which  sense  of  “alarm”  is  being  used,  and  the  processing  saved 
on  metonymy  would  be  used  up  by  the  correspondingly  more  difficult  nominal  compound 
problem . 

2  A  part  of  the  Strategic  Computing  program  sponsored  by  the  Defense  Advanced 
Research  Projects  Agency. 
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component,  in  sophisticated  natural  language  systems  of  the  future. 

The  TACITUS  system  has  four  principal  components.  First,  a  syntactic 
front-end,  the  DIALOGIC  system  (Grosz  et  al.,  1982),  translates  sentences 
of  a  text  into  a  logical  form  in  first-order  predicate  calculus,  described  in 
Section  3.1.  Second,  we  are  building  a  knowledge  base,  specifying  large 
portions  of  potentially  relevant  knowledge  encoded  as  predicate  calculus 
axioms  (Hobbs  et  al.,  1986).  Third,  the  TACITUS  system  makes  use  of  the 
KADS  theorem  prover,  developed  by  Mark  Stickel  (Stickel,  1982).  Finally, 
there  is  the  pragmatics  component,  which  uses  the  theorem  prover  to  draw 
appropriate  inferences  from  the  knowledge  base,  thereby  constructing  an 
interpretation  of  the  text.  At  the  present  time,  the  pragmatics  component 
deals  only  with  local  pragmatics,  and  what  it  does  is  the  subject  of  this 
paper.  In  addition,  however,  we  are  beginning  to  augment  the  pragmatics 
component  with  procedures  for  relating  the  text  to  the  user’s  interests,  and 
we  plan  to  augment  it  with  procedures  for  recognizing  discourse  structure. 

Section  2  describes  the  three  local  pragmatics  problems  we  are  currently 
devoting  our  efforts  to.  The  solutions  to  each  of  them  requires  constructing 
and  proving  a  particular  logical  expression.  In  Section  3  we  discuss  how 
an  expression — the  interpretation  expression — is  constructed  for  an  entire 
sentence,  such  that  its  proof  constitutes  an  interpretation  of  the  sentence. 
We  also  discuss  how  the  search  for  a  proof  of  this  expression  can  be  ordered. 
Very  often,  interpretation  requires  that  certain  facts  be  assumed,  where  the 
only  warrant  for  the  assumptions  is  that  they  lead  to  a  good  interpretation. 
These  are  called  “implicatures”.  In  Section  4  we  describe  our  current  ap¬ 
proach  to  implicature  and  an  approach  we  are  just  beginning  to  investigate. 
In  Section  5  we  describe  and  illustrate  the  current  implementation. 

2  Local  Pragmatics  Phenomena 

2.1  Interpretation  as  Deduction 

Language  does  not  give  us  meanings.  Rather,  it  gives  us  problems  to  be 
solved  by  reasoning  about  the  sentence,  using  general  knowledge.  We  get 
meaning  only  by  solving  these  problems.  Before  we  can  use  what  is  asserted 
in  a  sentence  to  draw  further  conclusions,  we  must  first  interpret  the  sentence 
by  deducing  its  presuppositions  from  the  knowledge  base. 

Since  knowledge  is  encoded  in  the  TACITUS  system  as  axioms  in  pred¬ 
icate  calculus,  reasoning  about  them,  and  hence  arriving  at  interpretations, 
is  a  matter  of  deduction.  To  interpret  a  sentence,  we  first  determine  from  the 
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sentence  what  interpretation  problems  we  are  required  to  solve,  i.e.,  w'hat 
local  pragmatics  phenomena  are  exhibited.  These  are  framed  as  expressions 
to  be  proved  by  the  deduction  component.  The  proofs  of  these  expressions 
constitute  the  interpretation  of  the  sentence.  Where  there  is  more  than  one 
interpretation,  it  is  because  there  is  more  than  one  proof  for  the  expressions. 

In  this  section,  we  describe  the  three  phenomena  we  are  addressing  first — 
reference,  metonymy,  and  nominal  compounds.  For  each  of  these,  we  de¬ 
scribe  the  expression  that  needs  to  be  proved.  For  the  last  two,  we  describe 
how  current  standard  techniques  can  be  seen  as  special  cases  of  our  general 
approach. 

2.2  Reference 

Entities  are  referred  to  in  discourse  in  many  guises.  They  can  appear  as 
proper  nouns,  definite,  indefinite,  and  bare  noun  phrases  of  varying  speci¬ 
ficity,  pronouns,  and  omitted  or  implicit  arguments.  Moreover,  verbs,  ad¬ 
verbs,  and  adjectives  can  refer  to  events,  conditions,  or  situations.  The 
problem  in  all  of  these  cases  is  to  determine  what  is  being  referred  to.  Here 
we  confine  ourselves  to  definite  noun  phrases,  although  in  Section  4  we  ex¬ 
tend  our  treatment  to  indefinite  and  bare  noun  phrases  and  nonnominal 
reference. 

In  the  sentence 

The  alarm  sounded. 

the  noun  phrase  “the  alarm”  is  definite,  and  the  hearer  is  therefore  expected 
to  be  able  to  identify  a  unique  entity  that  the  speaker  intends  to  refer  to. 
Restating  this  in  theorem-proving  terminology,  the  natural  language  system 
should  be  able  to  prove  constructively  the  expression 

(3  i)a/arm(i) 

That  is,  it  must  find  an  x  which  is  an  alarm  in  the  model  of  the  domain.  If 
it  succeeds,  it  has  solved  the  reference  problem.3 
Similarly,  in  the  text 

(2)  The  compressor  is  down. 

The  air  inlet  valve  is  clogged. 

3  In  this  paper  we  ignore  the  problem  of  the  uniqueness  of  the  entity  referred  to.  A  hint 
of  our  approach  is  this:  If  the  search  for  a  proof  is  heuristically  ordered  by  salience,  then 
the  entity  found  will  be  the  uniquely  most  salient. 
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we  need,  in  interpreting  the  second  sentence,  to  prove  the  existence  of  an  air 
inlet  valve.  We  know  from  the  first  sentence  that  there  is  a  compressor,  and 
our  model  of  the  domain  tells  us  that  compressors  have  air  inlet  valves.  So 
we  can  conclude  that  the  reference  is  to  the  air  inlet  valve  of  that  compressor. 

In  processing  the  casreps  there  is  a  further  wrinkle  in  the  problem — noun 
phrases  rarely  have  determiners,  and  there  is  no  clear  signal  whether  it  is 
definite  or  indefinite.  This  problem  is  dealt  with  in  Section  4. 

2.3  Metonymy 

In  metonymy,  or  indirect  reference,  we  refer  to  one  thing  as  a  way  of  referring 
to  something  related  to  it.  Sentence  (1)  contains  the  phrase  “after  the 
alarm”,  where  what  is  really  meant  is  “after  the  sounding  of  the  alarm”. 
“The  alarm”  is  used  to  refer  to  the  sounding  which  is  related  to  it,  and  in 
interpreting  the  phrase  we  need  to  coerce  the  alarm  to  its  sounding. 

Metonymy  is  extremely  common  in  discourse;  when  examined  closely, 
very  few  sentences  will  be  found  without  an  example.  Certain  functions  very 
frequently  provide  the  required  coercions.  Wholes  are  used  for  parts;  tokens 
are  used  for  types;  people  are  used  for  names.  Nunberg  (1978),  however,  has 
shown  that  there  is  no  finite  set  of  possible  coercion  functions.  The  relation 
between  the  explicit  and  implicit  referents  can  be  virtually  anything. 

From  a  generation  point  of  view,  the  story  behind  metonymy  must  go 
something  like  this:  A  speaker  decides  to  say 

...  A  after(Eo,  Ei)  A  sound'(Ei,A)  A  alarm(A) 

that  is,  Eq  is  after  the  sounding  E\  of  the  alarm  A.  However,  given  the 
first  and  last  predications,  the  middle  one  is  obvious,  and  hence  can  be  left 
out.  Since  after  needs  a  second  argument  and  A  has  to  be  the  argument  of 
something,  after  takes  A  as  its  second  argument,  yielding 

...  A  after(Eo, A)  A  alarm(A) 
or  “after  the  alarm”. 

From  an  interpretation  point  of  view,  the  story  is  this:  Every  morpheme 
in  a  sentence  corresponds  to  a  predication,  and  every  predicate  imposes  se- 
leciional  constraints  on  its  arguments.  Since  entities  in  the  text  are  generally 
the  arguments  of  more  than  one  predicate,  there  could  well  be  inconsistent 
constraints  imposed  on  them  (especially  in  light  of  the  above  generation 
story).  To  eliminate  this  inconsistency,  we  interpose,  as  a  matter  of  course, 
another  entity  and  another  relation  between  any  two  predications.  Thus, 
when  we  encounter  in  the  logical  form  of  a  sentence 
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...  A  after(eo,a)  A  alarm(a) 

we  assume  that  what  is  intended  is  really 

...  A  after(eo,k)  A  rel(k,a)  A  alarm(a) 

for  some  entity  k  and  some  relation  rel.  The  predication  rel(k,a)  functions 
as  a  kind  of  buffer,  or  impedence  match,  between  the  explicit  predications 
with  their  possibly  inconsistent  constraints.  In  many  cases,  of  course,  there  is 
no  inconsistency.  The  argument  satisfies  the  selectional  constraints  imposed 
by  the  predicate.  In  these  cases,  k  is  a  and  rel  is  identity.  This  in  fact  is  the 
first  possibility  tried  in  the  implemented  system.  Where  this  fails,  however, 
the  problem  is  to  find  what  k  and  rel  refer  to,  subject  to  the  constraint, 
imposed  by  the  predicate  after ,  that  A:  is  an  event. 

Therefore,  TACITUS  modifies  the  logical  form  of  the  sentence  to 

...  A  after(ea,k )  A  rel(k,a)  A  alarm(a) 

and  for  an  interpretation,  the  expression  that  must  be  proved  constructively 
is 


(3  k,rel,a)event(k)  A  rel(k,a)  A  alarm(a) 

We  need  to  find  an  event  k  bearing  some  relation  rel  to  the  alarm. 

The  most  common  current  method  for  dealing  with  metonymy,  e.g.,  in 
the  TEAM  system  (Grosz  et  al.,  1985),  is  to  specify  a  small  set  of  possible 
coercion  functions,  such  as  name-of .  This  method  can  be  captured  in  the 
present  framework  by  treating  rel  not  as  a  predicate  variable,  but  as  a 
predicate  constant,  and  expressing  the  possible  coercions  in  axioms  like  the 
following: 

(V x,y)name(x,y)  D  rel(x,y ) 

That  is,  if  x  is  the  name  of  y,  then  y  can  be  coerced  to  x.  This  in  fact  is  the 
method  we  have  implemented  in  our  initial  version  of  the  TACITUS  system. 

2.4  Nominal  Compounds 

To  interpret  a  nominal  compound,  like  “lube  oil  alarm”  (where  “lube  oil” 
is  taken  as  a  multiword),  it  is  necessay  to  discover  the  implicit  relation 
between  the  two  nouns.4  Some  relations  occur  quite  frequently  in  nominal 

‘Some  nominal  compounds  can  of  course  be  treated  as  single  lexical  items.  This  case 
is  not  interesting  and  is  not  considered  here. 
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compounds — part-of ,  location ,  purpose.  Moreover,  when  the  head  noun  is 
relational,  the  modifier  noun  is  often  one  of  the  arguments  of  the  relation. 
Levi  (1978)  argued  that  these  two  cases  encompassed  virtually  all  nominal 
compounds.  However,  Downing  (1977)  and  others  have  shown  that  virtually 
any  relation  can  occur.  A  lube  oil  alarm,  for  example,  is  an  alarm  that 
sounds  when  the  pressure  of  the  lube  oil  drops  too  low. 

To  discover  the  implicit  relation,  one  must  prove  constructively  from  the 
knowledge  base  the  existence  of  some  possible  relation,  which  we  may  call 
nn,  between  the  entities  referred  to  by  the  nouns: 

(3  x,  y)alarm(x)  A  lube-oil(y)  A  nn(y,x) 

Just  as  with  metonymy,  the  most  common  method  for  dealing  with  nom¬ 
inal  compounds5  is  to  hypothesize  a  small  set  of  possible  relations,  such  as 
part-of.  In  our  framework,  we  can  use  this  approach  by  taking  nn  to  be  not 
a  predicate  variable  but  a  predicate  constant,  and  encoding  the  possibilities 
in  axioms  like 

(V x,y)part(x,y)  D  rm(y, x) 

For  example,  if  a  blade  x  is  a  part  of  a  fan  y,  then  “fan  blade"  is  a  possible 
nominal  compound.  Equality  also  implies  an  nn  relation,  for  nominal  com¬ 
pounds  like  “metal  particle”  (an  x  such  that  x  is  metal  and  x  is  a  particle). 

To  deal  with  relational  nouns,  such  as  “oil  sample”  and  “oil  pressure", 
we  encode  axioms  like 

(3)  (V x,y)sample(x,y)  D  nn(y,x) 

This  tells  us  that  if  i  is  a  sample  of  oil  y,  then  x  can  be  referred  to  by  the 
nominal  compound  “oil  sample". 

Finin  (1980)  argues  that  one  of  the  most  common  kinds  of  relations  is 
one  that  involves  the  function  of  the  referent  of  the  head  noun.  The  function 
of  a  pump  is  to  pump  a  fluid,  so  “oil  pump"  is  a  possible  nominal  compound. 
This  can  be  encoded  in  axioms  of  the  pattern 

(Vz,y,  e)  function(e,  x)  A  p'(e,x,y)  D  nn(y,x ) 

That  is,  if  e  is  the  function  of  x  where  e  is  the  situation  of  x  doing  something 
p  to  y,  then  there  is  an  nn  relation  between  y  and  x. 

As  with  metonymy,  in  our  initial  version  of  TACITUS,  it  is  the  standard, 
restricted  method  that  we  have  implemented.  This  is  because  we  wanted 

s  Other  than  treating  them  as  multiwords. 
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to  make  sure  we  were  not  losing  ground  in  seeking  a  general  solution.  Nev¬ 
ertheless,  our  approach  allows  us  to  begin  experimenting  with  the  general 
solution  to  the  nominal  compound  problem,  where  the  implicit  relation  can 
be  anything  at  all. 

3  The  Construction  and  Proof  of  the  Interpreta¬ 
tion  Expression 

3.1  Preliminary  Note  on  Logical  Form 

DIALOGIC,  the  syntactic  front  end  of  TACITUS,  produces  a  logical  form  for 
the  sentence  in  something  like  a  first-order  logic  but  encoding  grammatical 
subordination  relations  as  well  as  predicate-argument  relations.  It  is  “on- 
tologically  promiscuous”  in  that  events  and  conditions  are  reified  (Hobbs, 
1985a).  A  slightly  simplified  version  of  the  logical  form  for  the  sentence 

(4)  The  lube  oil  alarm  sounded, 
is 

(5)  past([e]  |  sound'(ci,[ai  |  alarm(a1)A 

nn([o,  |  /u&e-oiltoi)],^)])]) 

u  |  ”  can  be  read  “such  that”  or  “where”,  so  that  a  paraphrase  of  this  formula 
would  be  “In  the  past  there  was  an  event  ei  which  was  a  sounding  event  by 
aj  where  aj  is  an  alarm  and  there  is  an  nn  relation  between  a\  and  cq  such 
that  o\  is  lube  oil. 

In  general,  the  logical  form  of  a  sentence  is  a  “proposition”.  A  proposi¬ 
tion  is  a  predicate  applied  to  one  or  more  arguments.  An  argument  is  either 
a  variable  or  a  “complex  term”.  A  complex  term  is  a  variable,  followed  by 
a  “such  that”  sign,  followed  by  a  “restriction”.  (Complex  terms  are  sur¬ 
rounded  by  square  brackets  for  readability.)  A  restriction  is  a  conjunction 
of  propositions. 

This  notation  can  be  translated  into  a  notation  using  four-part  quan¬ 
tifier  structures  (Woods,  1977;  Moore,  1981)  by  successively  applying  the 
following  transformation: 

I  ?(*)])  q{x)  v(x)T 

Quantifiers  other  than  existentials  are  ignored  in  this  paper.  For  the  treatment  we 
intend  to  give  them,  see  Bobbs  (1983). 
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It  can  be  translated  into  standard  Russellian  notation,  with  a  consequent  loss 
of  information  about  grammatical  subordination,  by  successively  applying 
the  following  transformation: 

J>([*  I  ?(*)])  =>  p(*)  A  q(x) 

3.2  Order  of  Interpretation 

As  we  saw  in  Section  2,  interpretation  involves  solving  a  number  of  problems, 
or  proving  a  number  of  expressions,  and  this  raises  a  question.  In  which  order 
should  we  try  to  solve  them?  A  naive  answer  would  be  to  try  to  solve  them 
“from  the  inside  out”.  Before  trying  to  find  the  lube  oil  alarm ,  we  should  try 
to  find  the  lube  oil  the  alarm  is  an  alarm  for.  Before  checking  that  the  lube 
oil  alarm  obeys  the  selectional  constraints  imposed  by  “sound”,  we  should 
learn  as  much  as  we  can  about  the  lube  oil  alarm;  in  particular,  we  should 
resolve  the  reference  of  “the  lube  oil  alarm”  so  we  know  what  lube  oil  alarm 
is  being  talked  about. 

This  means  that  given  the  logical  form  (5),  we  should  solve  the  local 
pragmatics  problems  in  the  following  order: 

1.  Find  the  reference  of  o\,  the  lube  oil.  Prove 

(3  0])/u6e-oi/(oi) 

2.  Given  that,  find  the  reference  of  aj,  the  alarm,  and  as  a  by¬ 
product,  find  the  implicit  relation  nn  encoded  in  the  nominal 
compound.  If  o i  was  resolved  to  O,  then  prove 

(3a1)a/arm(ai)  A  nn{a\,0) 

3.  Given  that,  check  the  predicate-argument  congruence  of  sound 
applied  to  aj.  If  aj  was  resolved  to  A  and  sound  requires  its 
argument  to  be  a  physical  object,  then  prove 

(3  k)physical-object(k)  A  rel(k,A) 

Unfortunately,  this  order  will  not  always  work.  Information  relevant  to 
the  solution  of  any  of  these  local  pragmatics  problems  can  come  from  the 
solutions  of  any  of  the  others.  For  example,  in  the  sentence 

This  thing  won’t  work. 

selectional  constraints  imposed  by  “work”  provide  more  information  about 
the  referent  of  “this  thing”  than  the  noun  phrase  itself  does. 
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Thus,  in  a  more  sophisticated  approach,  we  would  construct  a  single 
expression  to  be  proved,  encoding  what  is  required  for  all  of  the  local  prag¬ 
matics  problems.  For  sentence  (4),  the  expression  would  be 

(3  k,ai,nn,oi)physical-object(k)  A  rel(k,a j)  A  alarm(aj) 
Ann(aj,oj)  A  lube-oil(o\ ) 

Let  us  call  this  the  interpretation  expression. 

The  conjuncts  of  the  interpretation  expression  could  be  proved  in  any 
order.  The  inside-out  order  is  only  one  possibility.  The  search  for  a  proof  is  a 
heuristic,  depth-bound,  breadth-first  search,  and  the  inside-out  order  can  be 
taken  as  an  indication  of  how  much  of  its  resources  the  theorem  prover  should 
devote  to  proofs  of  the  various  conjuncts,  and  how  early.  More  resources 
should  be  devoted  earlier  to  the  initial  conjuncts  in  inside-out  order.  But 
other  possible  orders  of  proof  must  be  left  open.  The  difficulty  with  this 
approach,  however,  is  that  it  is  hard  to  get  partial  results  in  cases  of  failure. 

We  are  currently  using  a  compromise  between  these  two  orders — a  fail- 
soft,  inside-out  order.  As  we  proceed  inside  out,  at  each  step  the  theorem- 
prover  is  given  the  full  expression  built  up  to  that  point.  However,  the 
expression  has  as  an  antecedent  the  instantiations  of  what  was  proven  in 
earlier  steps.  Thus,  in  step  3  in  the  example,  the  expression  is 

lube-oil(O)  A  alarm(A)  A  nn(A,0 )  D 
(3 k,ai,Oi)physical-object(k)  A  re/(fc,ai) 

Aalarm(ai)  A  nn(ai,Oj)  A  lube-oil(o} ) 

Those  prior  instantiations  consistent  with  higher  constraints  will  be  proven 
immediately  from  the  antecedent,  and  new  proofs  will  need  to  be  discovered 
only  for  those  which  are  inconsistent.7 

3.3  The  Algorithm  for  Constructing  the  Interpretation  Ex¬ 
pression 

The  required  expression  can  be  constructed  by  a  recursive  procedure  which 
for  convenience  we  will  call  PRAG.  PRAG  is  called  with  a  proposition  and 
a  logical  expression  as  its  two  arguments.  Initially,  PRAG  is  called  with  the 
logical  form  of  the  sentence  as  its  first  argument  and  T  as  its  second.  The 
second  argument  (call  it  expr )  will  be  used  to  build  up  the  interpretation 
expression  for  the  sentence. 

7This  technique  is  due  to  Mark  Stickel. 
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First,  to  handle  the  congruence  requirement  imposed  by  the  predicate 
p  of  the  proposition  on  its  arguments,  if  the  knowledge  base  contains  the 
selectional  constraint 

p(x)  :  r(x) 

i.e.,  that  r  must  be  true  of  x,  then  r(k )  A  rel(k,a )  is  conjoined  to  expr 
where  k  is  a  new  existentially  quantified  variable,  and  the  relevant  part  of 
the  logical  form  is  altered  from  p(a)  to  p(fc)  A  re/(fc,a) 

Next,  each  of  the  arguments  is  processed  in  turn.  To  resolve  reference  for 
an  argument  of  the  form  [a  |  P],  all  of  the  complex  terms  in  P  are  replaced 
by  their  lead  variables  and  the  result  is  conjoined  to  expr. 

Finally,  for  each  of  the  arguments  of  the  proposition,  PRAG  is  called 
recursively  on  all  of  the  conjuncts  in  its  restriction  P  (with  the  original 
complex  terms  in  P  intact),  and  the  results  are  conjoined  to  expr.  PRAG 
returns  the  interpretation  expression  expr. 

3.4  Minimality 

Axioms  can  be  assigned  a  cost,  depending  upon  their  salience.  High  salience, 
low  cost  axioms  would  then  be  tried  first.  Short  proofs  are  naturally  tried 
before  long  proofs.  Thus,  a  cost  depending  on  salience  and  length  is  as¬ 
sociated  wtih  each  proof,  and  hence  with  each  interpretation.  Where,  as 
usually  happens,  there  is  more  than  one  possible  interpretation,  the  better 
interpretations  are  supported  by  less  expensive  proofs. 

The  second  criterion  for  good  interpretations  is  that  we  should  favor 
the  minimal  solution  in  the  sense  that  the  fewest  new  entities  and  relations 
needed  to  be  hypothesized.  For  example,  the  argument -relation  pattern 
in  nominal  compounds,  as  in  “lube  oil  pressure”,  is  minimal  in  that  no 
new  implicit  relation  need  be  hypothesized;  the  one  already  given  by  the 
head  noun  will  do.  In  metonymy,  the  identity  coercion  is  favored  for  the 
same  reason,  and  shorter  coercions  are  favored  over  longer  ones.  Similarly, 
in  the  definite  reference  example  (2),  the  air  inlet  valve  of  the  mentioned 
compressor  is  favored  over  the  air  inlet  valve  of  the  compressor  adjacent  to 
the  mentioned  compressor,  because  of  the  same  minimality  prinicple. 

These  ideas  at  least  give  us  a  start  on  the  very  difficult  problem  of 
choosing  the  best  interpretation. 
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4  Implicatures  and  Abduction 

4.1  Given  and  New,  Definite  and  Indefinite,  Presupposed 
and  Asserted 

When  we  hear  a  sentence,  we  try  to  match  part  of  the  information  it  con¬ 
veys  with  what  we  already  know;  the  rest  is  new  information  we  add  (or 
decide  not  to  add)  to  what  we  know.  In  our  approach  to  reference,  proving 
constructively  from  the  knowledge  base  the  existence  of  a  definite  entity  is 
precisely  the  operation  of  matching  the  definite  noun  phrase  with  what  we 
already  know.  Indefinite  noun  phrases,  by  contrast,  require  us  to  introduce 
anew  entity,  rather  than  find  an  already  existing  entity.  However,  a  problem 
arises  in  the  casreps  that  is  really  just  an  aggravated  form  of  a  problem  that 
arises  generally.  There  are  virtually  no  articles.  Sentence  (1)  was  really 

Disengaged  compressor  after  lube  oil  alarm. 

Consequently,  we  can  almost  never  know  whether  an  entity  is  definite  or 
not.  It  can  go  either  way.  In 

(6)  Metal  particles  in  oil  sample  and  filter. 

the  oil  filter  is  something  we  know  about  already.  It  is  in  our  model  of  the 
device.  “Oil  filter”  is  definite.  On  the  other  hand,  we  are  just  being  told 
that  a  sample  of  the  oil  was  taken.  “Oil  sample”  is  indefinite. 

In  general  discourse,  where  articles  do  occur,  a  problem  still  arises,  since 
definite  articles  are  sometimes  used  where  the  entity  is  not  really  known.  If 
a  speaker  begins  a  sentence  with 

The  trouble  with  John  is  . , . 

it  may  be  that  both  the  speaker  and  hearer  know  John  has  trouble  and  are 
able  to  resolve  the  reference.  Or  it  could  be  that  the  speaker  is  introduc¬ 
ing  for  the  first  time  the  fact  that  there  is  a  problem  with  John.  Related 
examples  and  an  account  of  this  phenomenon  can  be  found  in  Hobbs  (1987). 

At  first  glance,  it  may  seem  that  this  problem  is  compounded  in  our 
ontologically  promiscuous  approach  to  logical  form.  There  are  entities  cor¬ 
responding  to  every  predication  made  by  the  sentence,  for  example,  the  dis¬ 
engaging  in  sentence  (1).  For  each  of  these  entities  we  must  decide  whether 
it  is  definite  or  indefinite,  and  we  are  never  given  an  article  to  tell  us  which 
it  is.  However,  this  turns  out  to  be  identical  with  the  traditional  problem  of 
determining  whether  a  predication  is  given  or  new,  or  in  other  terminology, 
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is  part  of  the  presuppositions  of  the  sentence  or  part  of  what  is  asserted. 
Thus,  the  ontologically  promiscuous  notation,  rather  than  compounding  the 
definite-indefinite  problem,  collapses  it  and  the  given-new  problem  under  a 
single  treatment. 

Normatively,  the  main  verb  of  a  sentence  asserts  new  information  and 
grammatically  subordinated  material  is  given.  But  this  is  not  always  true. 
In 


The  philosophical  Greeks  contributed  much  to  civilization. 

it  is  unclear  whether  “philosophical”  is  intended  to  be  used  referentially  as 
given  information  (the  restrictive  cause)  or  is  another  new  assertion  being 
slipped  into  the  sentence  (the  nonrestrictive  case).  In 

An  innocent  man  was  hanged  today. 

it  could  be  that  the  speaker  and  hearer  both  know  a  man  was  hanged  today, 
and  the  speaker  is  asserting  his  innocence.  Where  there  is  an  adverbial,  as 
in 


John  saw  his  brother  recently. 

it  is  unclear  (without  intonation)  whether  the  seeing  or  the  recency  or  both 
is  being  asserted  as  new  information. 

A  heuristic  we  tried  initially  was  to  assume  that  everything  represented 
by  an  event  variable  (ei,  e2,. . .)  corresponds  to  new  information,  i.e.,  is  being 
asserted,  and  everything  else  is  definite  and  is  being  used  referentially.  This 
is  reasonably  accurate  in  the  casreps,  but  sentence  (6)  shows  that  it  is  not 
adequate  everywhere.  Consider  also  the  text 

The  low  lube  oil  alarm  sounded. 

The  alarm  was  activated  during  routine  start  of  start  air  com¬ 
pressor. 

One  can  argue  that  the  existence  of  an  activation  is  already  implicit  in  the 
sounding,  and  that  therefore  the  activation  is  given,  or  definite. 

The  real  story  is  that  it  is  part  of  the  job  of  pragmatics  to  determine 
whether  each  proposition  in  the  sentence  is  being  asserted  or  presupposed, 
and  whether  each  noun  phrase,  regardless  of  surface  form,  is  really  definite 
or  indefinite.  This  can  be  accomplished  by  means  of  referential  implicatures, 
which  is  our  current  method  for  handling  this  problem. 
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4.2  Referential  Implicatures 

Let  us  begin  with  the  simplest  case — clear  indefinites,  as  in 
A  blade  of  the  fan  was  chipped. 

We  cannot,  at  the  outset,  simply  assert  the  existence  of  a  B  such  that  B  is 
the  blade  of  the  fan,  for  we  have  not  yet  identified  the  fan.  If  we  followed  the 
naive  search  order  of  Section  3.2,  we  could  wait  until  the  fan  was  identified, 
assert  the  existence  of  one  of  its  blades,  and  proceed  to  interpret  the  rest 
of  the  sentence.  However,  in  the  sophisticated  search  order,  we  cannot  do 
this,  for  metonymy  problems  higher  up  in  a  logical  form,  say,  for  “chip”, 
may  need  to  be  solved  before  reference  problems  lower  down  can  be  solved, 
and  these  metonymy  problems  will  need  information  about  its  argument. 
Moreover,  several  fans  may  be  proposed  as  the  referent  of  “the  fan”,  and 
B  cannot  be  a  blade  of  all  of  them.  It  must  be  the  blade  of  the  fan  finally 
decided  upon. 

To  handle  this  problem,  as  we  process  the  sentence  in  the  routine  PRAG, 
we  temporarily  add  to  the  knowledge  base,  statements  asserting  the  exis¬ 
tence  of  the  indefinite  entities.  For  indefinites  at  the  bottom  of  the  logical 
form,  this  is  straightforward.  For 

A  metal  chip  was  found  in  the  sump. 

we  simply  assert 

(3y)metaf(y)  A  chip(y) 

For  indefinites  that  are  functionally  dependent  on  definites,  things  are  a 
little  more  complicated.  We  cannot  say 

(3  x,y)blade(x,y) 

for  there  would  be  no  guarantee  the  fan  finally  selected  would  be  that  y.  We 
cannot  say 

(Vy)(3  x)blade(x,y) 

for  certainly  not  everything  has  a  blade.  We  must  make  an  assertion  of  the 
form 


(Vy)/an(y)  D  (3 x)blade(x,y) 
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Think  of  this  as  saying,  for  any  way  that  you  can  resolve  “the  fan”,  there  is 
something  which  is  its  blade.  But  even  this  is  not  enough.  It  may  be  that  we 
know  about  some  fans  that  have  no  blades,  and  adding  this  assertion  would 
make  our  knowledge  base  inconsistent.  Thus,  we  need  something  more  like 
the  nonmonotonic  assertion 

(7)  (V  y)  f  an(y)  A  CONSISTENT\(3x)blade(x ,  y)] 

D  ( 3x)blade(x,y ) 

In  principle,  this  is  what  we  believe  is  correct.  The  procedure  CONSISTENT 
could  be  implemented  by  a  procedural  call  within  the  theorem  prover  to  the 
theorem  prover  itself.  But  of  course,  there  is  no  guarantee  it  will  terminate. 
So  in  practice,  our  present  strategy  is  simply  to  assume  consistency,  ignoring 
the  problem.  A  more  principled  approach  would  be  to  do  some  simple 
type-checking  for  inconsistencies,  and  if  none  are  found,  simply  to  assume 
consistency. 

We  may  call  assertions  like  (7)  “referential  implicatures”. 

Now  let  us  return  to  the  problem  of  Section  4.1,  that  it  is  impossible 
in  general  to  know  when  a  reference  is  definite  or  indefinite,  or  whether  a 
proposition  is  presupposed  or  asserted.  We  can  solve  this  problem  by  con¬ 
structing  referential  implicatures  for  every  entity  in  the  logical  form,  whether 
from  a  definite,  indefinite,  or  bare  noun  phrase,  or  a  nonnominal  reference. 
Of  course,  if  this  were  all  we  did,  every  sentence  would  be  easy  to  interpret 
and  the  interpretation  would  fail  to  tell  us  anything.  For  definite  references, 
especially,  we  do  not  want  to  use  the  referential  implicatures  unless  all  else 
fails.  To  accomplish  this,  we  associate  costs  with  the  various  referential 
implicatures.  Referential  implicatures  for  explicitly  indefinite  NPs  are  free. 
The  ones  for  explicitly  definite  NPs  are  quite  expensive.  Those  for  bare 
NPs  are  intermediate  between  the  two,  and  those  for  events,  introduced,  for 
example,  by  verb  phrases,  are  less  expensive  than  those  for  bare  NPs  but 
not  free.  These  costs  are  factored  into  the  cost  of  proofs  leading  to  inter¬ 
pretations,  so  that  interpretations  not  making  use  of  expensive  referential 
implicatures  are  cheaper  and  hence  better,  if  they  are  available.  Thus,  some¬ 
thing  is  taken  as  new  information  only  when  it  fails,  after  an  appropriate 
amount  of  processing,  to  be  recognized  as  given. 

4.3  Identity  Implicatures 

A  second  kind  of  implicature  that  would  be  necessary  in  this  kind  of  ap¬ 
proach  is  an  assumption,  for  no  other  reason  than  that  it  will  lead  to  a 
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good  interpretation  of  the  text,  that  two  entities  are  identical.  The  use  of 
such  implicatures  for  resolving  pronoun  references  was  discussed  in  Hobbs 
(1979).  Here  we  will  restrict  our  attention  to  their  use  in  resolving  nominal 
compounds. 

Let  us  consider  “oil  sample”  again.  Suppose  we  have  already  inferred  the 
existence  of  the  oil — oil(x).  Suppose  also  we  have  assumed  by  the  referential 
implicature  the  existence  of  a  sample  y  of  something  z — sample(y,z).  We 
need  to  prove  nn(x,y).  Axiom  (3)  tells  us  that  if  y  is  a  sample  of  x  then 
there  is  an  nn  relation  between  them.  The  only  thing  required  for  a  proof  is 
therefore  an  assumption  that  the  oil  y  and  the  implicit  second  argument  z  of 
sample  are  identical.  Since  this  would  lead  to  a  good  interpretation,  we  are 
tempted  to  do  this.  However,  we  would  like  to  check  for  consistency  first. 
When  we  do  some  simple  type  checking,  we  find  that  z,  since  it  can  have  a 
sample  taken  of  it,  must  be  a  material,  and  we  also  find  that  the  oil  x  is  a 
material.  This  does  not  prove  consistency,  but  it  provides  a  coincidence  of 
properties  that  at  least  makes  an  inconsistency  less  likely.  So  we  go  ahead 
and  make  the  identification.  A  problem  with  this  approach  is  that  it  is  not 
clear  how  the  drawing  of  identity  implicatures  can  be  triggered  or  controlled. 

Grice  (1975)  gave  the  name  “conversational  implicature”  to  an  assump¬ 
tion  one  had  to  make  simply  in  order  to  get  a  good  interpretation  of  a 
sentence.  Referential  implicatures  and  identity  implicatures  are  particularly 
elementary  and  widespread  cases  of  such  assumptions. 

4.4  Abduction  and  Redundancy 

We  are  currently  exploring  a  different  approach  to  this  whole  family  of 
problems — abductive  reasoning.  Pople  (1973)  and  Cox  and  Pietrzykowski 
(1986)  have  proposed  abductive  reasoning  as  a  means  for  diagnosis  in  expert 
systems.  Abductive  reasoning  is  reasoning  to  the  best  explanation.  If  we 
know  q(a )  and  we  know  (V  x)p(x)  D  g(z),  then  abductive  reasoning  leads  us 
to  conclude  p(a).  Intuitively,  p(a)  is  our  best  guess  for  why  the  observed  q(a) 
is  true.  The  problem  with  this  is  choosing  the  best  p(a)  among  a  conceiv¬ 
ably  large  set  of  possibilities.  Both  Pople  (1973)  and  Cox  and  Pietrzykowski 
(1986)  proposed  choosing  the  most  specific  unprovable  atom  as  the  best  ex¬ 
planation.  Thus,  an  abscess  in  the  liver  is  a  better  explanation  than  a  pain 
in  the  chest.  Stickel  (1987)  points  out  problems  with  this  and  argues  that 
often  in  natural  language  interpretation,  the  least  specific  unprovable  atom 
is  the  most  appropriate  one  to  be  assumed.  Thus,  if  “a  fluid”  is  mentioned, 
we  should  not  assume  it  is  lube  oil. 
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A  generalization  of  this  kind  of  abductive  capability  is  now  being  imple¬ 
mented  in  the  KADS  theorem  prover.  It  will  allow  us  to  recast  the  whole 
problem  of  definite  and  indefinite  reference.  The  interpretation  expression 
will  be  constructed  as  before.  Instead  of  referential  implicatures  being  as¬ 
serted  with  their  associated  costs,  the  same  costs  would  now  be  attached  to 
the  atoms  to  be  proved  as  the  cost  of  simply  assuming  them.  The  atoms 
will  be  assumed  with  their  most  specific  bindings,  which  will  perform  the 
function  of  including  the  antecedents  in  the  referential  implicatures.  There¬ 
fore,  if  a  definite  reference  is  resolvable  with  respect  to  the  knowledge  base, 
it  will  be  resolved  with  a  proof  considerably  cheaper  than  one  requiring  the 
assumption  of  the  existence  of  an  entity  of  that  description.  However,  if  it 
is  not  resolvable,  its  existence  will  be  assumed. 

This  approach  also  gives  us  a  way  of  dealing  with  examples  like 

Investigation  revealed  adequate  lube  oil  saturated  with  metal 
particles. 

Here,  “lube  oil”  is  given  information,  while  “adequate”  and  “saturated  with 
metal  particles”  are  new.  Under  the  abductive  approach  lube-oil(x)  will  be 
resolved  with  the  corresponding  atom  in  the  domain  model,  the  binding  will 
propagate  to  adequate(x')  and  satuTate(ps,  z),  and  these  instantiated  atoms 
will  then  be  assumed.  Solving  this  problem  using  referential  implicatures 
would  be  extremely  cumbersome. 

There  is  a  further  possible  benefit  from  the  abductive  approach;  it  may 
take  the  place  of  identity  implicatures  and  allow  us  at  last  to  exploit  the 
natural  redundancy  of  all  discourse.  An  example  can  illustrate  this  best. 
Consider  the  sentence 

Inspection  of  lube  oil  filter  revealed  metal  particles. 

There  are  several  coreference  problems  involving  implicit  arguments.  We 
would  like  to  be  able  to  discover  that  the  person  doing  the  inspection  was 
the  same  as  the  person  to  whom  the  particles  were  revealed,  and  we  would 
like  to  know  that  the  metal  particles  were  found  in  the  lube  oil  filter.  This  in¬ 
formation  is  not  explicit  in  the  sentence.  The  general  problem  is  to  discover 
the  coreference  relations  among  arguments  in  syntactically  independent  re¬ 
gions  of  a  sentence. 

Let  us  unpack  the  words  in  the  sentence  to  see  the  overlap  of  semantic 
content.  If  x  inspects  y,  then  x  looks  at  y  in  order  that  this  looking  will 
cause  x  to  learn  6ome  property  relevant  to  the  function  of  y.  In  order  to 
avoid  quantifying  over  predicates,  let  us  assume  an  analysis  of  location,  or 
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at ,  that  allows  properties  metaphorically  to  be  located  at  entities.  Then  we 
can  state  formally, 

(\/ei,x,y)inspect'(eux,y)  = 

(3  e2,e3,z,e4)Zoofc-at'(ei,:E,y)  A  cause(ei,e2) 

Alearn,(e2,x,e 3)  A  a/'(e3,z,y)  A  relevant-to(e3,  e4) 

A  function(e4,y) 

If  an  event  ej  reveals  z  to  x,  then  there  is  a  y  such  that  ej  causes  x  to 
learn  that  z  is  at  y.  Formally, 

(Vei  ,z,x)reveal(ei,z7x)  = 

(3  e2,e3,y)cau.se(ei,e2)  A  /earn'(e2,2 r,e3)  A  at'(e3,z,y) 

A  filter  is  something  whose  function  is  to  remove  particles.  Formally, 
(Ve6,y,w)filter'{e6,y,w)  = 

( 3  e4,z,s)function(e4,y )  A  remoue'(e4,  y,  z,  w)  A  particle(z) 

A  typical-element(z,  s ) 

If  y  removes  z  from  w,  then  there  is  a  change  from  z’s  being  in  w  to  z’s 
being  at  y. 

(Ve4,j/,z,u))reTnoue'(e4,y,z,to)  = 

(3  e8,e3)c/ianpe,(e4,e8,e3)  A  tV(e8,z,  w)  A  ai'(e3,z,y) 

Finally,  let  us  say  the  end  point  of  a  change  is  relevant  to  the  change. 

(Ve4,e8,e3)c/ianpe'(e4,e8,e3)  D  relevant-to(e3,e4 ) 

Now  the  interpretation  expression  will  include 

inspect'(ei , x\ ,  y)  A  reveal(e  1 ,  z, i2)  A  filter1  (e e,y,  w )  A particle(z) 

A  typical-element(z ,  5) 

If  the  above  axioms  are  used  to  expand  this  expression,  then  the  operation 
that  Stickel  calls  “factoring”  and  Cox  and  Pietrzykowski  call  “synthesis”  can 
apply;  we  can  unify  goal  atoms  wherever  possible.  We  can  thus  unify  the 
variables  as  indicated  in  the  way  we  have  named  them  in  the  axioms.  Further 
suppose  that  atoms  resulting  from  factoring  have  enhanced  assumability, 
since  they  will  lead  to  minimal  interpretations.  If  we  assume  those  atoms, 
then  we  will  have  concluded  that  the  inspector  ii  and  the  beneficiary  x2  of 
the  revealing  are  identical  and  that  the  particles  are  in  the  filter. 
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One  difficulty  with  is  approach  is  the  possible  inefficiency  introduced 
by  allowing  the  results  of  factoring  to  be  assumable-  Another  difficulty  is 
whether  the  bidirectional  implications  in  the  above  axioms  are  really  justi¬ 
fied,  and  how  the  procedure  could  be  made  to  work  if  we  only  had  implication 
to  the  right.  These  issues  are  under  investigation. 

5  Implementation 

In  our  implementation  of  the  TACITUS  system,  we  are  beginning  with  the 
minimal  approach  and  building  up  slowly.  As  we  implement  the  local  prag¬ 
matics  operations,  we  are  using  a  knowledge  base  containing  only  the  axioms 
that  are  needed  for  the  test  examples.  Thus,  it  grows  slowly  as  we  try  out 
more  and  more  texts.  As  we  gain  greater  confidence  in  the  pragmatics  op¬ 
erations,  we  move  more  and  more  of  the  axioms  from  our  commonsense  and 
domain  knowledge  bases  into  the  system’s  knowledge  base.  Our  initial  ver¬ 
sions  of  the  pragmatics  operations  are,  for  the  most  part,  fairly  standard 
techniques  recast  into  our  abstract  framework.  When  the  knowledge  base 
has  reached  a  significant  size,  we  will  begin  experimenting  with  more  general 
solutions  and  with  various  constraints  on  those  general  solutions. 

To  see  what  the  program  does,  let  us  examine  its  output  for  one  sentence. 


Tacitus>  operator  was  unable  to  maintain  lo  pressure  to  sac 


“Lo”  is  an  abbreviation  for  “lube  oil”  and  “sac”  is  an  abbreviation  for  “start¬ 
ing  air  compressor”.  The  sentence  is  parsed  and  six  parses  are  found.  Prepo¬ 
sitional  phrase  attachment  ambiguities  are  merged  to  reduce  the  number  of 
readings  to  four.  The  highest  ranking  parse  is  the  correct  one  because  the 
adjective  complement  interpretation  is  favored  over  the  purpose  clause  in¬ 
terpretation  for  infinitive  clauses,  and  because  the  attachment  of  “to  sac”  to 
“pressure”  is  favored  both  by  a  heuristic  that  favors  right  attachment  and 
one  that  favors  argument  prepositions  attached  to  their  relational  nouns. 
The  logical  form  is  produced  for  this  parse.  It  can  be  read  “In  the  past 
there  was  a  condition  E12  which  is  the  condition  of  XI  being  unable  to  do 
E3  where  E3  is  the  possible  event  of  XI,  who  is  the  operator,  maintaining  X4, 
which  is  the  pressure  of  something  Y1  at  XiO,  which  is  the  starting  air  com¬ 
pressor  (and,  by  the  way,  is  not  identical  to  X4),  and  there  is  some  implicit 
relation  NN  between  X6,  which  is  lube  oil,  and  X4. 
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OPERATOR  PAST1  BE  UNABLE  TO  MAINTAIN  LO  PRESSURE  TO  SAC 
six  parses  were  found 


After  merging  ambiguities,  there  are  four  logical  forms 
The  Highest  Ranking  LF : 

(E  (E13  E12  E2  X4  Ell  X10  Y1  E5  E7  X6  E8  E3  XI) 

(PAST!  E13 

(E12  (UNABLE!  E12  XI 

(E3  (MAINTAIN!  E3 

(XI  (OPERATOR!  E2  XI)) 

(X4  (PRESSURE!  E5  X4  Y1 

(XIO  (SAC!  Ell  XIO) 

(N0T=  XIO  (X4)))) 

(NN!  E8  (X6  (LUBE- OIL!  E7  X6)) 
X4) ))))))) 


The  sentence  is  interpreted  from  the  inside  out,  so  the  first  problem  is 
finding  the  reference  of  “operator”.  “BARE”  means  there  is  no  determiner. 


Reference  Problem:  XI:  treated  as  type  BARE 

II 

Prove:  (E  (xl  e2) 

(Operator!  e2  xl)) 


II. V 


The  reference  is  resolved  by  unifying  xl  with  the  constant  oprl  in  the  axioms 
that  encode  the  domain  model,  oprl  has  the  property  Operator. 


Reference  Resolved: 
xl  =  oprl 


This  was  established  by  inferring  the  following  proposition  from  the  axioms, 
operator-nessl  is  the  condition  of  oprl’s  having  the  property  Operator. 
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Inferred  the  following  propositions: 

(Operator!  operator-nessl  oprl) 


The  next  problem  is  the  reference  of  “sac”.  We  do  not  use  the  non¬ 
coreference  information  encoded  by  Not=  at  the  present  time.  It  is  always 
assumed  to  be  true.  The  reference  is  resolved  by  identifying  the  sac  as  the 
one  mentioned  in  the  domain  model. 


Reference  Problem:  X10:  treated  as  type  BARE 

IIII 

Prove:  (E  (xlO  ell  x4) 

(AND  (Not=  xlO  cons (x4 .nil) ) 

(Sac!  ell  xlO))) 


ID*  I  .  W 

Reference  Resolved: 
xlO  =  sacl 

Inferred  the  following  propositions: 

(Not=  sacl  cons(X195,nil)) 
(Sac!  sac-nessl  sacl) 


The  next  problem,  moving  from  the  inside  out,  is  to  satisfy  the  con¬ 
straints  the  word  “pressure”  places  on  its  arguments.  A  coercion  constant 
k3,  which  is  related  to  the  entity  sacl  that  we  have  already  resolved  XlO 
to,  is  introduced  to  take  care  of  the  possibility  of  metonymy.  The  word 
“pressure”  requires  that  yl  must  be  a  fluid  that  can  be  located  at  k3. 


Metonymy  Problem: 

(PRESSURE!  E5  X4  Yl  XlO) 

III  II II I 

Prove:  (E  (k3  yl  k5  k4  x4) 

(AND  (Not=  sacl  cons (x4 .nil)) 
(Fluid!  k4  yl) 

(At!  k5  yl  k3) 
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(Related  k3  sacl))) 


The  stars  ajid  bars  tell  the  user  that  the  theorem  prover  is  working  away. 


ID* |***|* |* |***| *| .t.* 


One  way  of  being  related  is  being  a  part  of,  and  the  bearings  are  a  part  of 
the  sac,  and  the  only  fluid  that  the  system  currently  knows  about  that  can 
be  at  something  related  to  the  sac  is  the  lube  oil.  So  it  is  determined  that 
it  must  be  the  pressure  of  the  lube  oil  at  the  bearings,  which  are  a  part  of 
the  sac.  Had  the  system  also  known  about  air,  it  could  have  come  up  with  a 
different  interpretation.  This  is  an  example  where  the  compound  nominal, 
and  thus  the  reference,  problem  for  “pressure”  should  have  been  done  at  the 
same  time,  and  where  exploiting  the  redundancy  of  information  encoded  in 
the  words  “lube  oil”  and  “pressure”  would  have  helped. 

The  instantiated  inference  steps  are  listed.  Lube  oil  is  known  to  be  a 
fluid  because  oil  is  and  lube  oil  is  oil.  It  is  known  to  be  at  the  bearings 
because  it  is  known  that  the  pump  transmits  lube  oil  from  the  pump  to  the 
bearings,  and  the  being  located  is  the  end  state  of  that  transmission.  The 
bearings  are  a  part  of  the  sac  because  they  are  a  part  of  the  lube  oil  system, 
which  is  a  part  of  the  sac. 


Metonymy  Resolved: 
yl  =  lube-oill 
xlO  =  sacl 
k3  -  bearingsl 

Inferred  the  following  propositions: 

(Partof  bearingsl  sacl) 

(Not=  sacl  cons (X206, nil)) 

(Fluid!  k4  lube-oill) 

(Oil!  oil-ness-ll(_)  lube-oill) 

(Lube-Oil!  lube-oil-nessl  lube-oill) 

(At!  k5  lube-oill  bearingsl) 

(Transmit!  transmit-ness2  pumpl  lube-oill  pumpl 
bearingsl) 

(Related  bearingsl  sacl) 
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(Component!  component -nes si  losysl  sacl) 
(Component!  component-ness3  bearingsl  losysl) 
(Partof  losysl  sacl) 


The  fact  that  there  has  been  a  coercion  is  reported  to  the  user. 


Coercion:  (Pressure!  e5  x4  yl  k3) 


Next  is  the  reference  problem  for  “lube  oil”,  which  is  solved  in  the  same 
way  as  the  two  previous  reference  problems. 


Reference  Problem:  X6:  treated  as  type  BARE 
11*111*1 

Prove:  (E  (x6  e7) 

(Lube-Oil!  e7  *6)) 


I|  .W 

Reference  Resolved: 
x6  -  lube-oill 

Inferred  the  following  propositions: 

(Lube-Oil!  lube-oil-nessl  lube-oill) 


The  reference  problem  for  “pressure”  is  addressed  with  its  arguments 
instantiated  with  the  values  that  have  already  been  discovered.  If  this  were 
inconsistent,  the  system  would  back  up,  and  try  to  prove  the  fail-soft  ver¬ 
sion  of  the  interpretation  expression  described  in  Section  3.2.  The  compound 
nominal  interpretation  problem  is  dealt  with  here  as  well.  It  is  solved  be¬ 
cause  the  relational  noun  -  argument  relation  is  one  possible  way  for  Nn  to 
be  true. 


Reference  Problem:  X4:  treated  as  type  BARE 
111111*111 

Prove:  (E  (x4  e5  e8) 
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(AND  (Nn!  e8  lube-oill  x4) 

(Pressure!  e5  x4  lube-oil 1  bearingsl))) 

I | *** | ********** ] | [ . | i | * |  j 

Reference  Resolved: 
x4  =  pressurel 
x6  =  lube-oill 
k3  =  bearingsl 
yl  =  lube-oill 

Inferred  the  following  propositions: 

(Nn!  e8  lube-oill  pressurel) 

(Pressure!  pressure-nessl  pressurel  lube-oill 
bearingsl) 


The  metonymy  problem  for  the  predicate  MAINTAIN  is  handled  next.  For 
something  to  be  maintained,  it  must  be  an  eventuality  that  is  desired  by  the 
maintainer.  The  adequacy  of  the  lube  oil  pressure,  being  a  normal  condition, 
is  desired  by  the  operator.  Hence,  “maintain  lube  oil  pressure”  is  coerced 
into  “maintain  the  adequacy  of  lube  oil  pressure”. 


Metonymy  Problem:  (MAINTAIN!  E3  XI  X4) 
1 1 1 ID  I | | ID* | 

Prove:  (E  (klO  kll  kl2) 

(AND  (Eventuality  kll) 

(Desire!  kl2  klO  kll) 
(Related  kll  pressurel) 
(Related  klO  oprl))) 


ID* 1 *** | * | .T.* 

Metonymy  Resolved: 
x4  =  pressurel 
kll  ~  adequate-nessl 
xl  =  oprl 
klO  =  oprl 
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Inferred  the  following  propositions: 

(Pressure!  pressure-nessl  pressurel  lube-oill 
bearingsl) 

(Adequate!  adequate-nessl  pressurel) 

(Related  oprl  oprl) 

(Desire!  kl2  oprl  adequate-nessl) 

(Normal  adequate-nessl) 

(Related  adequate-nessl  pressurel) 

Coercion:  (Maintain!  e3  oprl  kll) 


The  system  also  tries  to  solve  nonnominal  reference  problems.  Here  it 
seeks  to  determine  if  it  already  knows  about  a  maintaining  event.  It  does 
not,  so  a  referential  implicature  introduces  it  as  a  new  entity. 


Reference  Problem:  E3:  treated  as  type  EVENT 
1 1*  HD*  | 

Prove:  (E  (e3) 

(Maintain!  e3  oprl  adequate-nessl)) 

II.* 

New  Entity  Introduced: 

E3 


The  constraint  UNABLE  places  on  its  arguments  is  that  E3  must  be  an 
eventuality.  This  is  verified.  A  possible  coercion  is  assumed  by  introducing 
the  coercion  constant  kl5,  but  identity  is  one  way  of  being  coerced. 


Metonymy  Problem:  (UNABLE!  E12  XI  E3) 

IID I ID* | 

Prove:  (E  (kl5) 

(AND  (Eventuality  kl5) 

(Related  kl5  maintain-ness-72) ) ) 


ID* I . * . ** 
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Metonymy  Resolved: 
e3  =  maintain-ness-72 
kl5  =  maintain-ness-72 

Inferred  the  following  propositions: 

(Related  e3  e3) 

Nonnominal  reference  is  determined  for  the  inability  as  well,  and  it  is 
determined  to  be  new. 

Reference  Problem:  E12:  treated  as  type  EVENT 
I | * | ID* 1 
Prove:  (E  (el2) 

(Unable!  e!2  oprl  maintain-ness-72)) 


II  .* 

New  Entity  Introduced: 
E12 


1=1*1 


This  completes  the  interpretation  of  the  sentence.  All  of  the  properties 
that  have  been  inferred  are  listed.  Those  properties  that  required  referential 
implicatures  are  new  information  and  are  listed  as  such. 


INTERPRETATION  OF  SENTENCE: 


New  Information: 

el3 :  (Past!  e!3  el2) 

ei2:  (Unable!  el2  oprl  e3) 

e3:  (Maintain!  e3  oprl  adequate-nessl) 


Old  Information: 
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oprl : 

adequate-nessl : 

pressurel : 

lube-oill : 


bearingsl : 

losysl : 

sacl: 
pumpl : 


(Operator!  operator-nessl  oprl) 

(Desire!  kl2  oprl  adequate-nessl) 
(Adequate!  adequate-nessl  pressurel) 
(Normal  adequate-nessl) 

(Related  adequate-nessl  pressurel) 
(Pressure!  pressure-nessl  pressurel 
lube-oill  bearingsl) 

(Nn!  e8  lube-oill  pressurel) 

(Fluid!  k4  lube-oill) 

(Oil!  oil-ness-11  lube-oill) 

(Lube-Oil!  lube-oil-nessl  lube-oill) 

(At!  k5  lube-oill  bearingsl) 

(Transmit!  transmit-ness2  pumpl  lube-oill 
pumpl  bearingsl) 

(Component!  component-ness3  bearingsl 
losysl) 

(Related  bearingsl  sacl) 

(Partof  bearingsl  sacl) 

(Partof  losysl  sacl) 

(Component!  component-nessl  losysl  sacl) 
(Sac!  sac-nessl  sacl) 


I=I=I=I=I=I=I=I=1=I=I=I-DDD II  I  I  I  I  I  II  I  I  I 


The  interpretation  of  the  sentence  makes  no  assumptions  about  the  exis¬ 
tential  status  of  the  various  eventualities  conveyed  by  the  sentence.  This  is 
done  in  a  final  phase  of  processing.  The  highest  level  eventuality  is  assumed 
to  exist,  and  decisions  are  propagated  down  from  there.  Thus,  since  the 
past-ness  exists,  the  inability  exists.  Since  the  inability  exists,  the  main¬ 
taining  does  not  exist.  Since  it  does  not  exist,  neither  does  the  adequacy. 
That  is  all  that  can  be  concluded  for  sure.  Simply  as  a  heuristic,  the  other 
eventualities  are  assumed  to  exist. 


Assuming  the  following  eventualities  do  exist: 

E12 ,  E13,  E8 ,  K12 ,  K4,  K5,  LUBE-0 IL-NESS 1 , 
0PERAT0R-NESS1 ,  PRESSURE-NESS1 ,  SAC-NESS1 
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Assuming  the  following  eventualities  do  not  exist: 
ADEqUATE-NESSl,  E3 


Acknowledgements 

The  authors  have  profited  from  discussions  with  Mark  Stickel,  Doug  Ed¬ 
wards,  Mabry  Tyson,  Bill  Croft,  Fernando  Pereira,  Ray  Perrault,  and  Stu 

Shieber  about  this  work.  The  research  was  funded  by  the  Defense  Advanced 

Research  Projects  Agency  under  Office  of  Naval  Research  contract  N00014- 
85-C-O013. 

References 

[1]  Cox,  P.  T.,  and  T.  Pietrzykowski,  1986.  “Causes  for  Events:  Their  Com¬ 
putation  and  Applications”,  Proceedings,  CADE-8,  pp.  608-621. 

[2]  Downing,  Pamela,  1977.  “On  the  Creation  and  Use  of  English  Compound 
Nouns”,  Language,  vol.  53,  no.  4,  pp.  810-842. 

[3]  Finin,  Timothy,  1980.  “The  Semantic  Interpretation  of  Nominal  Com¬ 
pounds”,  Report  T-96,  Coordinated  Science  Laboratory,  University  of 
Illinois,  Urbana,  Illinois,  June  1980. 

[4]  Grice,  H.  P.,  1975.  “Logic  and  Conversation”,  in  P.  Cole  and  J.  Morgan, 
eds.,  Syntax  and  Semantics,  vol.  3,  pp.  41-58,  Academic  Press,  New  York. 

[5]  Grosz,  Barbara,  Norman  Haas,  Gary  Hendrix,  Jerry  Hobbs,  Paul  Martin, 
Robert  Moore,  Jane  Robinson,  Stanley  Rosenschein,  1982.  “DIALOGIC: 
A  Core  Natural- Language  Processing  System”,  Technical  Note  270,  Arti¬ 
ficial  Intelligence  Center,  SRI  International. 

[6]  Grosz,  Barbara  J.,  Douglas  E.  Appelt,  Paul  Martin,  Fernando  C.  N. 
Pereira  and  Lorna  Shinkle,  1985.  “The  TEAM  Natural-Language  Interface 
System”,  Final  Report,  Project  4865,  Artificial  Intelligence  Center,  SRI 
International,  Menlo  Park,  California. 

[7]  Hobbs,  Jerry  R.,  1979.  “Coherence  and  Coreference”,  Cognitive  Science, 
vol.  3,  no.  1,  pp.  67-90. 


28 


{8]  Hobbs,  Jerry  R.,  1983.  “An  Improper  Treatment  of  Quantification  in 
Ordinary  English”,  Proceedings,  21st  Annual  Meeting  of  the  Association 
for  Computational  Linguistics ,  Cambridge,  Massachusetts,  pp.  57-63. 

[9]  Hobbs,  Jerry  R.,  1985.  “Ontological  Promiscuity”,  Proceedings,  23rd  An¬ 
nual  Meeting  of  the  Association  for  Computational  Linguistics,  Chicago, 
Illinois,  pp.  61-69. 

[10]  Hobbs,  Jerry  R-,  1985.  “Implicature  and  Definite  Reference”,  Report 
No.  CSLI-87-99,  Center  for  the  Study  of  Language  and  Information,  Stan¬ 
ford  University,  Stanford,  California,  May  1987. 

[11]  Hobbs,  Jerry  R.,  William  Croft,  Todd  Davies,  Douglas  Edwards,  and 
Kenneth  Laws,  1986.  “Commonsense  Metaphysics  and  Lexical  Seman¬ 
tics”,  Proceedings,  24th  Annual  Meeting  of  the  Association  for  Computa¬ 
tional  Linguistics ,  New  York,  June  1986.,  pp.  231-240. 

[12]  Levi,  Judith,  1978.  The  Syntax  and  Semantics  of  Complex  Nominals, 
Academic  Press,  New  York. 

[13]  Moore,  Robert  C.,  1981.  “Problems  in  Logical  Form”,  Proceedings,  19th 
Annual  Meeting  of  the  Association  for  Computational  Linguistics,  Stan¬ 
ford,  California,  pp.  117-124. 

[14]  Nunberg,  Geoffery,  1978.  “The  Pragmatics  of  Reference”,  Ph.  D.  thesis, 
City  University  of  New  York,  New  York. 

[15]  Pople,  Harry  E.,  1973.  “On  the  Mechanization  of  Abductive  Logic”, 
Proceedings,  International  Joint  Conference  on  Artificial  Intelligence, 
Stanford,  California,  August  1973,  pp.  147-152. 

[16]  Stickei,  Mark  E.,  1982.  “A  Nonclausal  Connection-Graph  Theorem- 
Proving  Program”,  Proceedings,  AAAI-82  National  Conference  on  Arti¬ 
ficial  Intelligence ,  Pittsburgh,  Pennsylvania,  pp.  229-233. 

[17]  Stickei,  Mark  E.,  1987.  “Pragmatics  as  Abduction:  Least-Specific  Ab¬ 
duction  and  its  Use  in  Natural-Language  Interpretation”,  manuscript. 

[18]  Woods,  William,  1977.  “Semantics  and  Quantification  in  Natural  Lan¬ 
guage  Question  Answering”,  Advances  in  Computers,  Volume  17,  Aca¬ 
demic  Press,  New  York,  pp.  1-87. 


29 


